Analisis Klasifikasi SMS Spam Menggunakan Logistic Regression

نویسندگان

چکیده

SMS or Short Message Service is usually found on cell phones. divided into two categories, namely spam and non-spam (ham). Spam an that annoying to phone users because it tends contain messages are not important such as promos scams. Meanwhile, (ham) tend SMS, from previous users. In this study, the classification of was carried out using logistic regression method. The purpose study distinguish classify between dataset in amounted 1143 data, there columns, text column label column. number for 566 577. proposed method gets a better accuracy 95%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Filtering Network Spam Message using Approximated Logistic Regression

The development of telecom network and Internet provides effective ways for communication. As an important way in communication, Short Messaging Service (SMS) via both telecom network and Internet has played an increasing important role in daily life. However, it usually suffers from spam SMS that causes misunderstanding and cheat. The highly varying content, network environment make the identi...

متن کامل

Identifying the Pertinent Features of SMS Spam

Mobile SMS spam is on the rise and is a prevalent problem. While recent work has shown that simple machine learning techniques can distinguish between ham and spam with high accuracy, this paper explores the individual contributions of various textual features in the classification process. Our results reveal the surprising finding that simple is better: using the largest spam corpus of which w...

متن کامل

SMS Spam Detection using Machine Learning Approach

Over recent years, as the popularity of mobile phone devices has increased, Short Message Service (SMS) has grown into a multi-billion dollars industry. At the same time, reduction in the cost of messaging services has resulted in growth in unsolicited commercial advertisements (spams) being sent to mobile phones. In parts of Asia, up to 30% of text messages were spam in 2012. Lack of real data...

متن کامل

SMS spam filtering: Methods and data

Mobile or SMS spam is a real and growing problem primarily due to the availability of very cheap bulk pre-pay SMS packages and the fact that SMS engenders higher response rates as it is a trusted and personal service. SMS spam filtering is a relatively new task which inherits many issues and solutions from email spam filtering. However it poses its own specific challenges. This paper motivates ...

متن کامل

cumulative logistic regression vs ordinary logistic regression

The common practice of collapsing inherently continuous or ordinal variables into two categories causes information loss that may potentially weaken power to detect effects of explanatory variables and result in Type II errors in statistical inference. The purpose of this investigation was to illustrate, using a substantive example, the potential increase in power gained from an ordinal instead...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jurnal Sistem Cerdas

سال: 2021

ISSN: ['2622-8254']

DOI: https://doi.org/10.37396/jsc.v4i3.166